Automatically Adjusting Content Taxonomies for Hierarchical Classification

نویسندگان

  • Lei Tang
  • Jianping Zhang
  • Huan Liu
چکیده

Hierarchical models have been shown to be effective in content classification. However, the performance of the model heavily depends on the given hierarchical taxonomy. We empirically show that different taxonomies can result in significant differences in hierarchical classification performance. Motivated by some real application problems, we aim to modify a content taxonomy automatically for different applications. In this work, we formulate the problem, discuss why it is feasible to achieve better performance in terms of classification performance via adjusting a given hierarchy, and present one effective solution to find better hierarchies compared with that of the given original hierarchy. Preliminary experiments on some real world data sets are reported and discussed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Inferring Efficient Hierarchical Taxonomies for MIR Tasks: Application to Musical Instruments

A number of approaches for automatic audio classification are based on hierarchical taxonomies since it is acknowledged that improved performance can be thereby obtained. In this paper, we propose a new strategy to automatically acquire hierarchical taxonomies, using machine learning methods, which are expected to maximize the performance of subsequent classification. It is shown that the optim...

متن کامل

Automatic Construction of Taxonomies of Categories

Hierarchies are an intuitive and effective organization paradigm for data. Of late there has been considerable research on automatically learning a hierarchical organizations of data. In this paper, we formulate the problem of “automatically constructing hierarchical taxonomies”, which we define as learning a hierarchy of categories with no user defined parameters. We propose a framework that c...

متن کامل

Rapid Induction of Multiple Taxonomies for Enhanced Faceted Text Browsing

In this paper we present and compare two methodologies for rapidly inducing multiple subject-specific taxonomies from crawled data. The first method involves a sentence-level words co-occurrence frequency method for building the taxonomy, while the second involves the bootstrapping of a Word2Vec based algorithm with a directed crawler. We exploit the multilingual open-content directory of the W...

متن کامل

Acclimatizing Taxonomic Semantics for Hierarchical Content Classification

Hierarchical models have been shown to be effective in content classification. However, we observe through empirical study that the performance of a hierarchical model varies with given taxonomies; even a semantically sound taxonomy has potential to change its structure for better classification. By scrutinizing typical cases, we elucidate why a given semantics-based hierarchy does not work wel...

متن کامل

Notes on hierarchical ensemble methods for DAG-structured taxonomies

Several real problems ranging from text classification to computational biology are characterized by hierarchical multi-label classification tasks. Most of the methods presented in literature focused on tree-structured taxonomies, but only few on taxonomies structured according to a Directed Acyclic Graph (DAG). In this contribution novel classification ensemble algorithms for DAG-structured ta...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006